Statistical Mechanics of the Mixture of Experts
نویسندگان
چکیده
We study generalization capability of the mixture of experts learning from examples generated by another network with the same architecture. When the number of examples is smaller than a critical value, the network shows a symmetric phase where the role of the experts is not specialized. Upon crossing the critical point, the system undergoes a continuous phase transition to a symmetry breaking phase where the gating network partitions the input space effectively and each expert is assigned to an appropriate subspace. We also find that the mixture of experts with multiple level of hierarchy shows multiple phase transitions.
منابع مشابه
Designing and validating a Model for Integration of Professional Ethics Components with Technical Competencies for Industrial Mechanics Branch
Background: The emerging world of work requires the acquisition of a set of non-technical competencies with technical competencies in a career for sustainable employment. This paper aims at designing and validating a model for integrating the ethical components with technical competencies in curriculum based on competency in industrial mechanics’ branch. Method: The research approach is based ...
متن کاملA Study of Na/K Feldspar Solid Solution Using Statistical Mechanics
Thermal behavior of various solid feldspars are different, namely those of bivalent cations show no change in the distribution of Al and Si atoms, whereas feldspars of univalent cations become more disordered with raising temperature. In the latter case Al atoms migrate from the sites they occupe at low temperatures and interchange positions with the Si atoms. At high temperatures (but stil...
متن کاملExperts or an Ensemble? a Statistical Mechanics Perspective of Multiple Neural Network Approaches
In the framework of statistical physics, we studied the 'en-semble learning' and the 'mixture of experts', which are the typical re-alizations of the mutiple neural network approach. Generalization capabilities of the two methods are analyzed. We discuss the pro and con of the two approaches, and the possibility of uniied method combining the merit of two approaches.
متن کاملMixture of Experts for Persian handwritten word recognition
This paper presents the results of Persian handwritten word recognition based on Mixture of Experts technique. In the basic form of ME the problem space is automatically divided into several subspaces for the experts, and the outputs of experts are combined by a gating network. In our proposed model, we used Mixture of Experts Multi Layered Perceptrons with Momentum term, in the classification ...
متن کاملBiaxial Buckling and Bending of Smart Nanocomposite Plate Reinforced by CNTs using Extended Mixture Rule Approach
In this research, the buckling and bending behaviour of smart nanocomposite plate reinforced by single- walled carbon nanotubes (SWCNTs) under electro-magneto-mechanical loadings is studied. The extended mixture rule approach is used to determine the elastic properties of nanocomposite plate. Equilibrium equations of smart nanocomposite plate are derived using the Hamilton’s principle based on ...
متن کامل